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^ 1, Introduction , During the past year four faculty members at State Univerr^ity 

(3 in New York, participated in a state-wide, ^7-nionth, NSF supported program of 



College (Geneseo), along with about 70 faculty froa 15 other undergraduate instiiut 
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experimentation and development of computer assisted instruction (CAI) using prima, ily 
APL (A Prograjuming Language),^ At Geneseo, this program is carried out with an IBM 
27^1 communications terminal with a telephone link to an IBM/370 155 computer at 
SUNY-Binghamton, 150 miles away* APL\370 is thus a remote-terminal, tiine-sharing 
system which uses this language with IBM Sirstem/370 computers* This paper presents 
some preliminary results of APL\370 in teaching econometrics. 

In order to understand the APL\370 ^stem more fully, it is useful to recall tliat 
there are two main modes of utilizing a computer; batch processing ^ Etnd time^sharing 
processing * In batch processing, computer programs are treated as if they are in a 
queue waiting to be run* Instructions may be punched on cards (or sorue other input 
means such as a terminal), the numerical data that go with the instructions are then 
added, and the deck of cards are then placed on the card reader to wait in line for 
processing* In time-sharing mode, the computer inputs data from many users simult^ 
aneously (in contrast to batch processing in yhich one program is read in at a time), 
keeping a kind of conversation with them. The users at the terminals slowly 
interact with the terminal, building up their programs or sitt^ly using the ^stem's 
stored libraiy of programs* Since the computer is extremely fast, the users will 
Ct*^ seldo^Ti have to wait for more than a second to obtain a response from the conputer. 

The APL\ 370 ^stem is one which provides a high degree of interaction between the 
user and the terminal, and is often referred to as "an interactive'* approach to 
compu t er programming. 

^APL riesignates the programming language that is the outgrowth of the work of K. B, , 
Iverson, first at Harvard and then at IBM* The name comes from the initials of liis 
book: A Programming Language (New York: Wiley, 1962) 
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The pa^rer and versality of APL\370 may be realized by means of a wide variety 
of computer programs. While we will not attempt to classify these programs, they 
usually can be placed under headings such as ga^es . tutorial s ^. simulations , and 
Droblem^so lying pgcka^ ^s. The problem-solving programs, such as The University of 
Alberta STATPACK2 package^, which is available in the public libraries of most AFI, 
systems, and the New Plata Library of Statistical Frogran:^^, are now used routinely 
in teaching economic statistics at SUC (Geneseo), To use these packages the studunt 
copies the appropriate program from the library, types in the vector (or matrix) of 
observations tind other data, and upon hitting the return key the results of the 
statistical analysis, which will include the estimated regression coefficients and 
their standard errors in a regression program, wil7 be displayed immediately. 

2. Obstacles to Teaching Econometrics , Applied econometrics consists primarily 
in the application of regression analysis to economic data. These data are mainly 
ncn-experimental observations ^mostly in the form of economic time series-^ and contain 
well-known limitations which may make them unsuited for use by the ordinary least 
squares (OLS) estimating procedure. 

In experimental data, for whi ^'h OLS was initially intended, one attempts to obtain 
observations which satisfy the assumptions of regression theory. However, in the (jase 
of non-experimental data, one attempts to find a theoiy or model which can best utili;',*^ 
the existing data, or one attempts to evaluate how effectively a given regression 
model is able to utilize these data. Indeed, many of the problems encountered in 
teaching econometrics stem from attempts to utilize inadequate empirical data in 
regression analysis. 

How can APL\370, with its rapid response and versatility, and its facility Tor 

handling arrays, aid in the Improvement of teaching effectiveness in econometrics? 

By using the computer to simulate observations which represent real life data, and 

"by testing various least squares estimating procedures on these data (in particular 
^Smillie, K. W-, STATPACK2: AN APL STATISTICAL PACKAGE . Department of Computer 3i;ience, 

The University of Alberta. 
3Prins, J. APLi HEW PUTZ LIBRARY OF STATISTICAL PROGRAMS . 4th Ed., 1972, SX (Wew 

Paltz). 
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OLS), the teaching of empirical econometrics should be greatly Cticilitat^^ti. 

3# Simulation of Economic nbservatlons . The simulations used in the pro^-^rams 
to be discussed shortly involve stochastic processes and are called llonte Carjc^ 
simLatlons. An essential feature of these simulations is the generation of ranJoni 
numbers. Fortunately, many APL libraries contain several random number distributions 
and generation routines that draw random numbers according to certain probability 
distributions. The routines used in these programs are contained in the School of 
Advanced Teclmology (SAT) Libraiy at SUNY-Binghamton* 

The advantage of APL\370 in Monte Carlo simulation is its rapid re^iponse and 
simplicity of use, but it should be mentioned that one disadvantage of this sy:-^t«m 
is that it cannot do large simulations easily* However, for instructicnal pui^poses, 
this limitation ia not serioxis. 

TvfO programs, out of six that have been written and evaluated by a class of 16 
students in econometrics, will be discussed* These programs involve the simulation 
of a set of n observations (sample size of n) of a sitt^le regiresslon niodel 
Y ^ A+BK"^Uj the plotting of the Y and X simulated values; the estimi.tion of the 
regression coefficients A and B and their comparison with the true population coeff- 
icients; and the plotting of the estimated and true regression lines for comparative 
purposes. The remaining programs, not discussed here, consist of Monte Carlo sim- 
ulations of various multiple regression models. 

Among the objectives of these two programs are: (l) To itt^ress the :5tudent with 

the fact that a sample of observations from economic time series is, in reality, a 

random sair^'l® from some hypothetical population; (2) That the estimated regression 

line is an empirical representation of the true regression line; (3) As the size of 

sample is increased, the estimated regression coefficients approximate more closely 

the true regression coefficients; (4) To emphasize the role played by the random U 

term in regression modelsj and (5) When the error term U is autocorrelated (serially 

^he generation routines iised in these models were written by Professor Peter iJoitaf?^ 
of the School of Advanced Technology, SUNY-Binghamton# 
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corrtilated), the standard errors of the regression coefficients are ineroasod r^iative 
to those of the non-serially correlated models- 
Objective (l) is particularly iiaportar because texrtbooks in this fiold usuaJ ly 
omit a worth while discTission of the underlying population from which a sample oT 
tiiae series observations is assximed to be drawn. Thais, students often fail to t'rasp 
the relationship between the sample that they arti working with and '-he underlying 
population. However, in the case of e^q^er^mental data, the student has much loss 
difficulty in relating the sample to the population. 

The first regression model to be discussed is: ^ = 3OO + .3X+U. In this modcvi , 
Y represents family expenditures on consumer durables, X is feunily incon^ in doliarc, arid 
U is a random eri'or term normally distributed with mean sero and variance as specified 
by the user. 3O0 + .3X is the true coii5>onent of Y, and U is the random coniponent. 
This model reveals that as family income increases by $1, a family will spend an 
additional 30 cents on consumer durables, subject to the random disturbance term U. 

To encourage interaction between user and the computer (or terminal), a number 
of responses are called for, including five YES and NO responses, as the program is 
executed. With regard to the numerical inputs, the user is asked to select the sample 
sise and the standard deviation of the U term. Family income is selected randomly 
by the compu^^^er from a imiform distribution on ohe interval $2000 to $25000, and 
the U terms are generated froia a normal distribution with mean zero and standard 
deviation selected by the user. The computer displays these U terms, the n income 
values, the true component of Y, and finally the generated Y values. Simulations 
may be repeated with different rample sises. Income ranges, and different standard 
deviations. 

Space limitations preclude the displaying of the APL programs in their entirety. 
However, portions of the print otits, beginning with the selection of saaple size 
by the user, are displayed below for sample sizes of 10 and 30, The estimated and 
true regression lines plotted in the print outs should form str?!iight lines, except 
for the approtximation errors in the PLOT function, 
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The second regression model is Y i + wher^ U is *Kjtf serially 

correlated according to the autocorre lation scheme U(t)r; RxU(t-l) + E( t) . ■ " 

the error term at tine t, U(t-l) is the orror term at time t-1, R is a constai.'. 
absolute value less than or equal to 1, and E(t) i;^ an error term nonrially Ui:-- ■ utf.rl 
with mean zero and specified variance. This formula generates the last n-1 U '^-^-r. 
based on an initial valtie for U, called U(l), which is selected randomly from -'-^ 
interval (*250,250). The Y and X variables are identical to those in the i'irc' -k^]. 
The students are asked to type in the standard deviation of E, the value oV 1^^ 1/ 
tipper and lower limits of family income, and the size sample. 

Many real life relationships between two variables correspond to ovr so<:os i 
model and the simulation of observations besed on this model should enable thtf i:'tJent 
to gain valuablti insights regarding the proper use of empirical data. In roal i^y, 
the U term contains the effects of all variables on the dependent variable X exr. f i 
the infltience of the independent variable X. Because in the real world a grea^ 
number of factors influence the dependent variable, the IT term in a regression 
model could be veiy important. 

It can be shown that when the error term U is autocor related bjr the abovr^ 
scheme, the estimates of the regression coefficients by OLS are unbiased, but Ui^^ 
standard errors of these coefficients will be veiy large relative to non^uto- 
correlated U terms. Hence, hypothesis testing and interval estimation concemini'^ 
the regression coefficients may not be reliable. 

Simulations of Y (family expenditures) and X ( family income) observatior;, 

according to our second model are shown below in print outs of progrars based v 

this model. The observations and plottings are displayed in much the same ntari » 

as those of the first print out. For both san^ile sizes of 10 and 30, the valuv 

of R is 1, the standard deviation of E is 500, and the lower and upper limits 

family Income are 2000 and 25000, respectively. 
. 

ERIC 



in ORDE^ TO GBtlERATE A SET OF OBSERVATION'S L'lTfl AVTOCORRELATED U TERHS OF THE 

F0E.-1 UiT)=R^U{T'l)*E(T) YOU WILL BE ASKED TO SUPPLY CERTAIfl liiFORIIATION . PLEASE 
SELECT Af} INTEGER EETWCEti 10 AND 30 EOR THE SAliPLE SZZ3 

0: 

10 

UHAT IS THE L'''VER LL'IIT OF THE FAMILY I^iC0:1E RANGE? 
Dt 

2000 

PLEASE SELECT A NU.'iBER FOR THE UPPER LIHIT OF FAMILY IHCO-HE 
Qt 

25000 

THE 10 lUCOliE VALVES SELECTED AT RASDOll FROM THE lUTERVAL 25000-2000 ARE 16257.7 8888. 

17 361.7 311.2 11252.9 10928.6 lill3i*.8 99 58 21*190. U 5512.1 

THE ERROR TERMS U ARE DETERMINED BY THE FORMULA U(T) =R^UiT-l)i-EiT) . THE I'HTIAL 

VALUE FOR £/» DENOTED UH) AND SELECTED FROM THE lUTERVAL -250 TO 250, IZ "'+7 
PLEASE SELECT A NUMBER BETWEEN 200 AND 700 FOR THE STANDARD DEVIATION OF E 
Ut 

500 

PLEASE SELECT A NUMBER BETWEEN "l AND 1 FOR THE VALUE OF R 
Ui 

1 

THE 10 AUTOCORRELATED U TERMS ARE '47 3U3 994.7 1394.1 1U57.9 2254.7 1952. E 23U7 
2255.65 2999.25 

THE SIMULATED EXPENDITURES ON CONSUMER DURABLES Y ARE FOUND BY ADDING 300+.3X 

TO THE U TERMS AND ARE AS FOLLOWS 5130.31 3309.55 6503.21 9024.36 51U3.77 
5833.28 6493.04 5634.8 9812.77 4952.68 THESE VALUES ARE PLOTTED AGAINST 
FAMILY INCOME ON THE FOLLOWING PAGE 
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It.' o:;^F,^ TO r.EnERATE a set of n 03i;ERVATio:.>s with AUTOcon.ir.LATEo u 7Er:-5 of the 

for:i f/(?)=.7xa(r-iu/?(r) you jill be asked to supply ckrtaiu lUFomiATioa . please 

SELECT AN lUTEGEB BETWEEN 10 AND 30 FOii THE SAi'iPLE SIZE 
30 

WHAT IS THE LOWES LH-HT OF THE FA'AILY I?/CO:iE RANGE? 
Ut 

2000 

PLEASE SELECT A NUMBER FOR THE UPPER LI'lIT OF FAMILY INCOME 
0: 

25000 

THE 30 INCOME VALUES SELECTED AT RANDOM FROM THE INTERVAL 25000-2000 ARE 9856.8 2280.6 

3159. 2 200»48.1 22230.8 12538.6 201 17 .1 20751.9 23833.9 7338.3 881+1+. 8 12872.1 
3 350.1 212»48 . 7 19990 .6 18916. S 13881 .8 9161+. 5 20631+. 6 10588. 2 18711+. 1 
19912.1+ 20257.'+ 20110 .2 16370.1+ 22290 .6 23638 . »4 1 3628 . 8 18»424.3 11839.1+ 

THE ERROR TERMS U ARE DETERMINED BY THE FORMULA i/( D =H>ci/( T-l ) +E( D . THE INITIAL 

VALUE FOR U, DENOTED i/( 1 ) A^D SELECTED FRO!'. THE INTERVAL -250 TO 250 , 15 "Q2.C25 

PLEASE SELECT A NUMBER BETWEEN 200 AND 700 FOR THE STANDARD DEVIATION OF E 

Ut 

500 

PLEASE SELECT A NUMBER BETWEEN "l AND 1 FOR THE VALUE OF R 
Ut 

1 

THE 30 AUTOCORRELATED U TERMS ARE "92 .625 'I9l.il25 66 . 175 1+05 .725 996.925 89t,275 

"190 . 225 2»4 . 725 50 . 22 5 »4 . 925 "338 . 225 "i+78 . 525 21+. 125 239.725 29 . 325 19 .625 
~20 .1 75 '112.025 701+.275 153.925 10 . 025 "l06'+. 375 'l 1+12.925 "j20»4.675 

"1958. 975 ~2»429 . 775 '31i+3 .625 ~2957. 675 ~3 37 3 . 62S "3339. 72S 

THE SIMULATED EXPENDITURES ON CONSUMED DURABLES Y ARE FOUND BY ADDIUG 200*. 2X 

TO THE U TERMS AND ARE AS FOLLOWS 316i4.i+15 792.755 1313 .335 6720 . 155 7966 . 165 
1+955.855 611+1+. 905 65 50 .295 7500 .395 2 506.»4l5 2615.215 3683.105 1329.155 
6911+. 3 35 6326. 505 599»4 . 57 5 »4»4i+i+. 365 2937 . 325 7191+.655 3630 . 385 550»4.205 
5208. 7»45 14961+. 295 5128. 385 3252. 11+5 i+557 .»405 »42»47 . 895 1»4 30 . 065 2»45 3 . 665 
512.095 THESE VALUES ARE PLOTTED AGAINST 
FAMILY INCOME ON THE FOLLOWING PAGE 
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4# AnalVBls of Results * With regard to the simulations of model 1 (pp, 5-8), 
it can be seen that as the sample size increased from 10 to 30, the estimated and 
true regression lines converged. In general, the students also experienced this 
sajne tendency when they executed the program. 

Even more revealing is the comparison of the first plotttngs of both print 
outs (pp# 6 and H), and also the second plottings in these print outs (pp# 8 and 13}* 
The first comparison is between the simulated values of Y and X for each model Tor 
sample sizes of 10, and the second is a comparison between the simulated Y and 
X values for sample sises of 30. It will be noted that the dispersion of the Y 
observations generated from the auto correlated model is much wider than that froci 
the non-autocorrelated model— a result that can be expected on the basis of stat- 
istical -theory, A comparison of the two plottings in the second print out (pp. 11 and 1-3/ 
also reveals that as the sample size increases, the dispersion of the X simulated 
values also increases. 

All 16 students in undergraduate econometrics in the spring of 1973 executed 
these programs enthusiastically. Everyone believed that they learned something 
about econometrics that they failed to leam from the text and lectures and they 
expressed a desire to see greater use made of this technique for instructional 
purposes. The objectives of these two programs, outlined earlier, were largely 
achieved. 

The students mentioned that they learned the following specific things from 
running these programs: (l) The realization that the estimated line does not coii:<;jae 
with the true regression line; (2) As the size of sample increased, these two lin*^:; 
approached one another; (3) A greater appreciation of the role played by the U 
term in regression analysis; (4) The nature of randomness, as illustrated by the 
display of the simulated observations, both graphically and tabularly; and (5) Th*^ 
wide dispersion of the dependent variable as autocorrelatton is introduced into 
regression models. 

The rae:n criticisms of the programs were that they are somewhat lengthy, and t^na'. 
not enough inptit was required by the user. 
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